BTCC / BTCC Square / Global Cryptocurrency /
Nvidia and Researchers Unveil Smarter Training Method for Game-Playing AIs

Nvidia and Researchers Unveil Smarter Training Method for Game-Playing AIs

Published:
2025-06-19 13:13:01
6
3

Nvidia, alongside researchers from the Politehnica University of Bucharest and Mila Quebec AI Institute, has developed a breakthrough reinforcement learning technique called Macro-Action Similarity Penalty (MASP). This method accelerates AI training by identifying similarities between macro-actions—bundled sequences of decisions—enabling more efficient learning in complex environments like video games and robotics.

The MASP approach outperformed established benchmarks such as RAINBOW-DQN in game testing, demonstrating superior adaptability in titles like Breakout and Street Fighter II. While the technique shows promise for applications in autonomous systems and adaptive gaming AI, its computational overhead and dependency on well-designed action sets present implementation challenges.

|Square

Get the BTCC app to start your crypto journey

Get started today Scan to join our 100M+ users